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Abstract. A hyperbolic conjugacy class in the modular group PSL(2, Z) cor- 
responds to a closed geodesic in the modular orbifold. Some of these geodesies 
virtually bound immersed surfaces, and some do not; the distinction is re- 
lated to the polyhedral structure in the unit ball of the stable commutator 
length norm. We prove the following stability theorem: for every hyperbolic 
element of the modular group, the product of this element with a sufficiently 
large power of a parabolic element is represented by a geodesic that virtually 
bounds an immersed surface. 



1. Introduction 

In many areas of geometry, it is important to understand which immersed curves 
on a surface bound immersed subsurfaces. Such questions arise (for example) 
in topology, complex analysis, contact geometry and string theory. In [5] it was 
shown that studying isometric immersions between hyperbolic surfaces with geo- 
desic boundary gives insight into the polyhedral structure of the second bounded 
cohomology of a free group (through its connection to the stable commutator length 
norm, defined in § 12. 2} . 

If E is a (complete) noncompact oriented hyperbolic orbifold, a (hyperbolic) con- 
jugacy class g in 7Ti(E) is represented by a unique geodesic 7 on E. The immersion 
problem asks when there is an oriented surface S and an orientation-preserving im- 
mersion i : S — > E taking dS to 7 in an orientation-preserving way. If the problem 
has a positive solution one says that 7 bounds an immersed surface. The immersion 
problem is complicated by the fact that there are examples of curves 7 that do not 
bound immersed surfaces, but have finite (possibly disconnected) covers that do 
bound; i.e. there is an immersion i : S — > E as above for which dS — > 7 factors 
through a covering map of some positive degree. In this case we say 7 virtually 
bounds an immersed surface. 

One can extend the virtual immersion problem in a natural way to finite rational 
formal sums of geodesies representing zero in (rational) homology. Formally, one 
defines the real vector space B^ (G) of homogenized 1-boundaries, (see § !2.2p . where 
G = 7Ti(E). It is shown in [5] that the set of rational chains in B^ for which the 
virtual immersion problem has a positive solution are precisely the set of rational 
points in a closed convex rational polyhedral cone with nonempty interior. This 
fact is connected in a deep way with Thurston's characterization (see [15]) of the 
set of classes in H%{M\ Z) represented by fibers of fibrations of a given 3-manifold 
M over S . It can be used to give a new proof of symplectic rigidity theorems 
of Burger- Iozzi-Wienhard [3], and has been used by Wilton [16] in his work on 
Casson's conjecture. Evidently understanding the structure of the set of solutions 
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of the virtual immersion problem is important, with potential applications to many 
areas of mathematics. Unfortunately, this set is apparently very complicated, even 
for very simple surfaces S. 

The purpose of this paper is to prove the following Stability Theorem: 

Stability Theorem 13.11 Let v be any hyperbolic conjugacy class in PSL(2,Z), 
represented by a string X of positive R 's and L 's. Then for all sufficiently large 
n, the geodesic in the modular orbifold A4 corresponding to the stabilization R n X 
virtually bounds an immersed surface in M.. 

This theorem proves the natural analogue of Conjecture 3.16 from 5 , with 
PSL(2, Z) in place of the free group F2. 

It follows from the main theorems of [5] that the elements v n £ PSL(2, Z) corre- 
sponding to the stabilizations of v as above satisfy scl(i>„) = rot(t? n )/2, where rot 
is the rotation quasimorphism on PSL(2,Z), and scl denotes the stable commutator 
length (see §[5]for details). Under the natural central extension <f> : B3 — s- PSL(2, Z) 
where B3 denotes the 3 strand braid group, there is an equality scl(6) = scl(0(6)) 
for all b € [B 3 , 2? 3 ]; consequently we derive an analogous stability theorem for stable 
commutator length in B%. 

We give the necessary background and motivation in § [2j Theorem 13. II is proved 
in § [3l In §[4] we generalize our main theorem to (2,p, oo)-orbifolds for any p > 3, 
and discuss some related combinatorial problems and a connection to a problem 
in theoretical computer science. Finally, in § [5] we describe the results of some 
computer experiments. 

2. Background 

We recall here some standard definitions and facts for the convenience of the 
reader. 

2.1. The modular group. The modular group PSL(2, Z) acts discretely and with 
finite covolume on H 2 , and the quotient is the modular orbifold VW, which can be 
thought of as a triangle orbifold of type (2, 3, 00). 

Every element of PSL(2, Z) either has finite order, or is parabolic, or is conjugate 
to a product of the form R ai L bl R a2 ■ ■ ■ L°" m where the ai and bi are positive integers, 
and where L and R are represented by the matrices 

?) r -(i : 

(the parabolic elements are conjugate into the form R a or L b ). 

The group PSL(2,Z) is abstractly isomorphic to the free product Z/2Z * Z/3Z. 

2.2. Stable commutator length. For a basic introduction to stable commutator 
length, see 0, especially Chapters 2 and 4. 

If G is a group, and g € [G, G], the commutator length of g (denoted cl(g)) 
is the smallest number of commutators in G whose product is g, and the stable 
commutator length scl((?) is the limit scl((?) := lim„_ ) . 00 c\(g n )/n. 

Stable commutator length extends in a natural way to a function on B\{G), 
the space of real group 1-boundaries (i.e. real group 1-chains representing in 
homology with real coefficients) and descends to a pseudo-norm (which can be 
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thought of as a kind of relative Gromov-Thurston norm) on the quotient B^ (G) := 
B 1 (G)/(g-hgh- 1 ,g n -ng). 

The dual of this space (up to scaling by a factor of 2) is Q(G)/H 1 (G) — i.e. 
homogeneous quasimorphisms on G modulo homomorphisms, with the defect norm. 
This duality theorem — known as Generalized Bavard Duality — is proved in [6], 
§ 2.6. A special case of this theorem, proved by Bavard in [2], says that for any 
g £ [G, G] there is an equality scl(<?) = sup^ 4>(g)/2 where the supremum is taken 
over all homogeneous quasimorphisms <fi £ Q{G) normalized to have defect D((f>) 
equal to 1. A quasimorphism cf> (with D((f>) — 1) is extremal for g if scl(g) = <j){g)/2. 

The theory of stable commutator length has deep connections to dynamics, group 
theory, geometry, and topology; however, although in principle this function con- 
tains a great deal of information, it is notoriously difficult to extract this informa- 
tion, and to interpret it geometrically. It is a fundamental question in any group 
G to calculate scl on chains in B^ (G), and to determine extremal quasimorphisms 
for such elements. Conversely, given a homogeneous quasimorphism <fi £ Q(G) 
(especially one with some geometric "meaning"), it is a fundamental question to 
determine the (possibly empty) cone in B^ (G) on which </> is extremal. 

2.3. Rotation quasimorphism. If G is a word-hyperbolic group, scl is a genuine 
norm on B^ (G). Moreover, it is shown in [4] and [5] that if G is virtually free, the 
unit ball in this norm is a rational polyhedron, and there are codimension one faces 
associated to realizations of G as the fundamental group of a complete oriented 
hyperbolic orbifold. 

Dual to each such codimension one face is a unique extremal vertex of the unit 
ball in Q/H 1 . In our case, PSL(2,Z) may be naturally identified with the funda- 
mental group of the modular orbifold. The unique homogeneous quasimorphism 
dual to this realization, scaled to have defect 1, is the rotation quasimorphism, 
denoted rot. This rotation function is very closely related to the Rademacher ip 
function, which arises in connection with Dedekind's r\ function, and is studied by 
many authors, e.g. [TJ [101 HI HH [H]> and so on (see especially [TO], § 3.2 for a 
discussion most closely connected to the point of view of this paper). In fact, up to 
a constant, the rotation quasimorphism is the homogenization of the Rademacher 
function; i.e. rot(g) = linin^oo ip(g n )/6n. 

The simplest way to define this function (at least on hyperbolic elements of 
PSL(2, Z)) is as follows. Associated to a hyperbolic conjugacy class g £ PSL(2, Z) 
is a geodesic 7 on X. The geodesic 7 cuts M. up into complementary regions 
Ri (see Figure [TJ for an example) . Join each region Ri to the cusp by a proper 
ray on, and define to be the signed intersection number ni = cti fl 7. Then 
rot(.g) = y~ J2i n i area(-Rj). In other words, up to a factor of 27r, the rotation 
number is the algebraic area enclosed by 7. Algebraically, if the conjugacy class 
of g has a factorization of the form R ai L bl ■ ■ ■ L bn then rot(g) = |(X] a i 
see [12] 1.5-6, or [T3] equation 70 (Rademacher denotes (p and 6 rot by $ and * 
respectively). 

By Bavard duality, one has scl(g) > rot(g)/2 for every g. Moreover, it is shown 
in [S] (for arbitrary free groups, though the proof easily generalizes to virtually free 
groups) that equality is achieved if and only if the geodesic representative 7 of g 
virtually bounds an immersed surface. That is, if and only if there is a hyperbolic 
surface S and an isometric immersion S — ¥ A4 wrapping dS some (positive) number 
n times around 7. Topologically, one can think of the problem of constructing such 
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Figure 1. 7 cuts a fundamental domain into regions Ri 

an immersed surface as a kind of jigsaw puzzle: one takes n ■ rii copies of each region 
Ri, where n, rii and Ri are as above, and tries to glue them up compatibly with 
their tautological embeddings in M. in such a way as to produce a smooth orbifold 
with geodesic boundary. Evidently, a necessary condition is that the n,; are all 
non-negative. However, this necessary condition is not sufficient. 

In [5] it was observed experimentally that for many words w £ [F2 , F2] , geodesies 
on a hyperbolic once-punctured torus corresponding to conjugacy classes of the 
form [a, b] n w all virtually bound immersed surfaces for sufficiently large n, and it 
was conjectured (Conjecture 3.16) that this holds in general. Our main theorem 
(Theorem 13.11 below) proves the natural analogue of this conjecture with the free 
group F2 replaced by the virtually free group PSL(2, Z). 

2.4. Braid group. The braid group B3 is a central extension of PSL(2, Z). Under 
this projection, the standard braid generators o\ and 02 are taken to R^ 1 and L 
respectively. It is straightforward to show that for any b € [B3,Bs] the (stable) 
commutator length of b is equal to the (stable) commutator length of its image in 
PSL(2, Z). Consequently, our main theorem shows that the rotation quasimorphism 
is extremal for a sufficiently large stabilization of any element of [B3, B3]. 

3. Proof of theorem 

The purpose of this section is to prove the following theorem: 

Theorem 3.1 (Stability theorem). Let v be any hyperbolic conjugacy class in 
PSL(2,Z), represented by a string X of positive R's and L's. Then for all suf- 
ficiently large n, the geodesic in the modular orbifold M. corresponding to the sta- 
bilization R n X virtually bounds an immersed surface in M. 

The proof will occupy the remainder of the section. 

The conjugacy class of v has a representative of the form R ai L bl R a2 ■ ■ ■ L bn where 
the a%, bi are all positive integers. We will show that the geodesic 7 corresponding 
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to v virtually bounds an immersed surface providing a\ is sufficiently big compared 
to Y^ijti a i + J2i bi- Evidently the theorem follows from this. We fix the notation 
N = a\ and N' = Y] { « 1 ai + X)i ^* m * ne sec l uei ! an d we prove the theorem under the 
hypothesis N > 37V' + lln + 3 (note that there is no suggestion that this inequality 
is sharp). 

In the modular orbifold Ai , let a denote the embedded geodesic segment running 
between the orbifold points of orders 2 and 3. The preimage of a in the universal 
cover H 2 is a regular 3-valent tree, which we denote a; see Figure [2] 



Figure 2. The tree a in the upper half-space model 



In the upper half-space model of H 2 , let W be the closure of the complementary 
component of H 2 — a stabilized by the translation z — > z+1. Then dW is a collection 
of circular arcs, whose vertices are the complex numbers e 2 ™/ 3 + n for n € Z. We 
call these arcs the segments of dW . Every segment of dW (orbifold) double covers 
the interval a in Ai. In the sequel we use the abbreviation ui = e 



2ni/3 



3.1. Arcs and subwords. The arc a cuts 7 into a collection of geodesic segments 
which correspond approximately to the R ai and L bi terms in the expression of v, 
in the following way. 

After choosing a base point and an orientation, a word v in the L's and i?'s 
determines a simplicial path in a, where L indicates a "left turn" , and R indicates 
a "right turn" , reading the word from right to left. The letters R or L correspond to 
the vertices of this path, and a string of the form RL b R (resp. LR a L) corresponds 
to a segment of length 6+1 (resp. a + 1) which, after translation by some element 
of PSL(2, Z), we can arrange to be contained in dW as a string of b+ 1 consecutive 
arcs moving to the right (resp. a + 1 consecutive arcs moving to the left). 

The bi-infinite power v :— ■ ■ ■ vvv ■ ■ ■ determines a bi-infinite path P(v) in a, 
which is a quasigeodesic in H 2 a bounded distance from the geodesic representative 
of an axis of (some conjugate of) v. We may thus crudely associate lifts of segments 
7j to W with such subwords. 

If we translate P{v) so that the segment corresponding to L b starts at the vertex 
ui of a, then this segment ends at ui + b+1. Moreover, the endpoints of P(v) on the 
real axis are contained in the intervals (—1, 0) and (b, b + 1). Let 7 be the infinite 
geodesic with the same endpoints as P(v). Then the intersection of 7 with W is 
either empty (for which b = 1 is necessary but not sufficient) or consists of two 
points, one in the segment of a from ui to ui ± 1, the other in the segment of a from 
ui + b to oj + b±l where the ±1 depends in each case on the rest of the word v (the 
degenerate case that 7 passes through one or two vertices of a is allowed). 

In our case of interest, this intersection 7 n W projects to the segment of 7 
corresponding to an L bi subword. Similarly, ai segments of 7 correspond to R ai 
subwords, with the caveat that some L bi or R ai subwords with ai or bi — 1 may 
not correspond to a segment of 7 at all. 



6 



DANNY CALEGARI AND JOEL LOUWSMA 



Example 3.2 (R 7 L 2 RL part 1). We will illustrate the main points of our construc- 
tion in a particular case. 

P(v) 




FIGURE 3. A bi-infinite path P(v) associated to v — R 7 L 2 RL. At 
each vertex, P turns left or right according to the letters of v (read 
from right to left). The path P(v) is chosen so that a segment 
corresponding to R 7 is contained in dW . 

Let v correspond to the conjugacy class R 7 L 2 RL which satisfies scl(w) = 5/12 
and rot(w) = 5/6. A matrix representative in PSL(2, Z) for v is ( 3 5 7 22 ). A bi-infinitc 
path P(v) is illustrated in Figure [3] and the corresponding axis 7 in Figure [4] 




prefix: RL L suffix: L LR 

Figure 4. An axis 7 obtained by straightening P(v). The "bar- 
rier" circles are associated to the bi-infinite words RLR 7 LR and 
LR 7 L. The part of 7 contained in W is a lift of ot\. 



The geodesic 7 is cut by a into three segments ot\, ft, a.2, corresponding to the 
subwords R 7 , L 2 and R. 

3.2. Lifts and surfaces. For each a>i or ft we choose lifts on and ft properly 
contained in W subject to the following lifting conditions: 

(1) the lifts are disjoint, and no two lifts intersect the same segment of dW 

(2) there are exactly five consecutive segments of dW between consecutive ft 

(3) the 5i and ft are not nested (i.e. they cobound disjoint disks with dW) 
except that the ft are all contained "under" the lift Si, so that the left- 
most vertex of 5i and the leftmost vertex of ft intersect segments of dW 
separated by exactly five other segments of dW . 

For each i ^ 1 let ft be the subsurface of W bounded by on and dW . Let 
T be the subsurface of W "above" all the ft and "below" Si. We will build our 
immersed surface from the ft and T, glued up suitably along their intersection 
with dW. Let S denote the (disjoint) union S = UiSi U T. The boundary of S 
comes in two parts: the part of the boundary along the on and ft (we denote this 
part of the boundary d y S) and the part along dW (we denote this part by dwS). 
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Furthermore, dwS decomposes naturally into segments which are the intersection 
with the segments of dW. There are two kinds of such segments: entire segments 
(those corresponding to an entire segment of dW contained in dS) 7 and partial 
segments (those corresponding to a segment of dW containing an endpoint of some 
cti or fa in its interior). We also allow the case of a degenerate partial segment, 
consisting of a single vertex of dW n Si or dW H Pi . 

By construction, the partial segments of dwS come in oppositely oriented pairs, 
ending on pairs of points of d-yS fl d\yS projecting to the same point in 7. We 
glue up such pairs of partial segments, producing a surface S'. Under the covering 
projection H 2 — > At, the surface S immerses in Ai in such a way that the immersion 
extends to an immersion of 5". The d 1 components glue up to produce a smooth 
boundary component 9 7 <S" which wraps once around 7 in Ai. The other part of dS' , 
which by abuse of notation we denote &wS', is a union of connected components, 
each of which is tiled by entire segments of dW . At each vertex corresponding to 
an end vertex of a partial segment, the segments of &wS' meet at an angle of 47r/3. 
At every other vertex the segments meet at an angle of 27r/3. We say such vertices 
are of type 2 and type 1 respectively. 

To complete the construction of an immersed surface virtually bounding 7 (and 
thereby completing the proof of Theorem l3.ip we must show how to glue up S' by 
identifying segments of dw to produce a smooth surface. Such a surface immerses 
in Ai, and is extremal for 7. In fact, technically it is easier to glue up S' to 
produce a smooth orbifold, containing orbifold points of order 2 and 3 that map to 
the corresponding orbifold points in Ai. Such an orbifold is finitely covered by a 
smooth surface virtually bounding 7. 



Example 3.3 (R 7 L 2 RL part 2). With notation as in part 1, we choose lifts Si, Pi 
and S2 as indicated in Figure [5] 

5i 




Figure 5. Lifts a±, Pi and a.2- The region T is between a\ and 
Pi , and the region S% is under S2 . The endpoints of the on and Pi 
are glued up by gluing ai to 02, 61 to 62, and ci to C2. 

Notice that the exponent 7 is too small: there is not enough room for P\ under 
Si without putting two endpoints (ci and C2) on adjacent partial segments. The 
result of gluing up these adjacent segments produces an orbifold point of order 3 
in the interior of S' . 

Furthermore, there are a pair of "degenerate" partial segments, consisting of the 
points ai and 02. Identifying these points produces a non-manifold point in S', 
where the two components 9 7 S" and dwS' meet, at a smooth point on <9 7 S", and 
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at a point with angle 47r/3 (i.e. a point of type 2) on d\yS' . This non-manifold 
point will become an ordinary manifold point after we glue up dwS' . 

The at and the endpoints of the partial segments containing the bi glue up into 
two adjacent type 2 points on d\yS' , and the remaining three vertices of d\yS give 
rise to three adjacent type 1 points on dwS'. Thus the vertices on dy/S' are of 
type 11221 in cyclic order. 

The interior of the 12 segment can be folded up, creating an orbifold point of 
order 2, and the other four segments identified in pairs (pairing 11 with 22), creating 
another orbifold point of order 3 corresponding to the "unpaired" 1 vertex. 

The result is a smooth orbifold S" with three orbifold points of orders 2, 3, 3, 
which immerses in M. with boundary wrapping once around 7. A finite (orbifold) 
cover of S" is a genuine surface, which 7 virtually bounds. 

3.3. Combinatorics. We have seen in general that dwS 1 is determined by combi- 
natorial data consisting of a finite collection of circularly ordered sequences of l's 
and 2's, which we call circles. We write such a circle as an ordered list of l's and 
2's, where two such ordered lists define the same circle (denoted ~) if they differ 
by a cyclic permutation. Hence 211 ~ 121 ~ 112 and so on. A consecutive string 
of l's and 2's contained in a circle is bracketed by a dot on either side; hence T2- 
is a sequence in the circle 122 and so on. 

The 2's correspond to vertices of dwS which are also vertices of dW, and are 
contained in partial edges; whereas the l's correspond to the other vertices of dwS 
which are also vertices of dW . Hence the total number of 2's is equal to the 
number of segments of 7 — a, which is at most n. The total number of l's is at 
most N + N' + n. 

The only combinatorial properties of the circles we need are the following: 

(1) each circle contains at least one 2, and consequently there are at most n 
circles (this is immediate from the construction); 

(2) some circle contains a sequence of at least N — N' — In consecutive l's, 
where N is large compared to N' and n (this follows from lifting conditions 
(2) and (3)); and 

(3) each circle contains a string of at least two consecutive l's (this follows from 
lifting condition (2)). 

We refer to the sequence of at least N — N' — 7n consecutive l's informally 
as the big 1 sequence. Providing N is sufficiently big compared to N' and n — 
equivalently, providing the big 1 sequence is sufficiently long — we can completely 
glue up dwS' as an orbifold. We now explain how to do this. 

The argument consists of a sequence of reductions to simpler and simpler com- 
binatorial configurations. These reductions are described using a pictorial calculus 
whose meaning should be self-evident. 

The first reduction consists of taking a pair of 1 l's and identifying the segments 
they bound. If the ll's are on different circles, these two circles become amalga- 
mated into a single circle. We call this the 1 -handle move; see Figure [Sj 

Hence, by bullets (2) and (3), by applying the 1-handle move at most n times, 
using up at most 2n of the l's in the big 1 sequence in the process, we can reduce 
to the case of a single circle. This circle has the form 1"V where l m is the big 1 
sequence, and v stands for some sequence of l's and 2's. Since N > 3N' + lln + 3, 
the big 1 sequence in dS' has length at least 2N' + 6n + 3. Using up at most 2n of 
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Figure 6. The 1-handle move 



these l's gives m > 2N' + 4n + 3. On the other hand, the length of v is at most 
N' + 2n by construction. 

We introduce three other simple moves: 

(1) a single TT is folded up, producing an interior orbifold point of order 2 
and a single 2 vertex; i.e. -11- — > -2-; or 

(2) the two adjacent -11- segments in a Til- are identified, producing an interior 
orbifold point of order 3 and a single 2 vertex; i.e. -Ill- — > -2-; or 

(3) the middle -12- segment of a -1121- is folded up, producing an interior 
orbifold point of order 2 and a single 2 vertex; i.e. -1121- — > -2- (also 
•1211- -> -2-). 

The first two moves are special cases of the 1-handle move, where the two TT 
segments being glued are not disjoint. By means of repeated applications of the 
•11- — > -2- move, a string of at most 2k consecutive l's can be reduced to any string 
of length k. 

Associated to any sequence of l's and 2's is its complement, obtained by reversing 
the order of the sequence and replacing each 1 by a 2 and conversely. We denote 
the complement of a sequence v by v c . We transform a subset of the big 1 sequence 
into the complement of v by such -11- — > -2- moves. This gives the reduction 
1"V — > l m v c v where m' > 3. 

The v sequence and its complement can be glued up in an obvious way, folding 
up the segment between the last letter of v c and the first letter of v and producing 
an interior orbifold point of order 2. This amounts to the reduction l m v c v — > l™ 1 2 
where m" > 1. After a finite sequence of -1121- — > -2- moves, we reduce to one of 
the cases 12, 112 or 1112. 

Folding up each edge of the 12 circle glues up the boundary completely, producing 
two interior orbifold points of order 2. Folding up the -12- edge and identifying the 
other pair of edges in the 112 circle glues up the boundary completely, producing 
two interior orbifold points or orders 2 and 3. Identifying the -12- and -21- edges 
with the succeeding pair of -11- edges of the 1112 circle glues up the boundary 
completely, producing two interior orbifold points of order 3. See Figure [71 



fold 




Figure 7. Gluing up 12, 112 and 1112 by folding 
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In every case therefore dwS' can be completely glued up, and the proof of 
Theorem 13. II is complete. 

Remark 3.4. The proof generalizes in an obvious way to formal sums. Let j n denote 
the geodesic corresponding to the conjugacy class R n X for some fixed string X. 
Let Si,-" , 5 m be any finite collection of geodesies in M.. Then for sufficiently large 
n, the 1-manifold j n Uj Si virtually bounds an immersed surface in M.. 

4. Generalizations 

In this section we discuss some generalizations of our main theorem. 

4.1. (2,p, oo)-orbifold. The purpose of this section is to give a proof of the fol- 
lowing theorem which generalizes Theorem 13. II to hyperbolic (2,p, oo)-orbifolds for 
any p > 3. 

Theorem 4.1 ((2,p, oo)-stability). Let M. p denote the hyperbolic (2, p, oo)- orbifold 
for p > 3, and let G p be its fundamental group. Let R € G p be the element 
corresponding to a positive loop around the puncture. Let v € G p be arbitrary. Then 
for all sufficiently large n, the geodesic in M. p corresponding to the stabilization R n v 
virtually bounds an immersed surface in M. p (providing this element is hyperbolic) . 

Remark 4.2. For any v, the elements R n v are eventually hyperbolic unless v is a 
power of R. 

Proof. The proof is very similar to the proof of Theorem 13.11 and for the sake of 
brevity we use language and notation as in § [3] by analogy. 

There is an embedded geodesic segment a p running between the orbifold points 
of orders 2 and p in M. p , covered by an infinite p-valent tree a p in H 2 . Let W p be 
the closure of the complementary component of H 2 — a p stabilized by a translation. 
Then dW p is a sequence of circular arcs meeting at an angle of 2ir/p. 

A conjugacy class R n v represented by a geodesic 7 is decomposed into a.i and f3i 
arcs by o~ p , where we use a, to denote the arcs whose lifts to W p move to the left, and 
the /3i to denote the arcs whose lifts to W p move to the right. For sufficiently large 
n, there is one arc a± with a lift 5i which intersects segments of dW p approximately 
n apart, whereas the combinatorial types of the ai (for i > 1) and fii are eventually 
constant. 

As in § 13.21 we choose disjoint lifts ai , f3i with the (3i under Si and with five 
segments of dW p between successive I3i. These lifts cobound surfaces T and Si 
which can be glued up to produce S' with dwS 1 a collection of circles with vertices 
labeled by numbers from 1 to (p — 1). As in § 13.21 we are guaranteed that each circle 
contains a -ll- , and one circle contains a big 1 sequence (which may be assumed to 
be as long as we like, by making n large). 

The 1-handle move still makes sense, so after performing a sequence of such 
moves we can reduce to the case of a single circle with a big 1 sequence; i.e. we 
have reduced to the case of 1"V for v an arbitrary (but fixed independent of all 
large n) sequence, and m as big as we like. Folding a -Ik- segment where k < p — 1 
produces an interior orbifold point of order 2, and a single (fc + 1) vertex. So we 
can turn a sequence of at most (p — consecutive l's into the complement v c 
and reduce \ m v — > l m v c v. Folding v c into v reduces to l m 2 as before. 

We must be a bit careful with the endgame depending on the value of m" mod p. 
We would like to reduce to a l p ~ k k circle for some k, since l p ~ k k — > l p_fe_1 (fc+l) — s- 
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• • • — > l(p — 1) by successive folds, and then l(p — 1) can be completely folded up, 
producing two orbifold 2 points (much as we completely folded up the 12 circle in 
the case p = 3). 

Fortunately this reduction can be accomplished if to" is sufficiently big (whatever 
its residue mod p). Folding an edge gives -11- — > -2-, but folding at a vertex gives 
■111- — > -2-, in either case producing an interior orbifold point of orders 2 and p 
respectively. By judicious application of some number of these moves (together 
with folds of the kind -ab- — > -(a + b)- if a + b < p or -labl- — » -2- if a + 6 = p) we 
can reduce l™ 1 2 for any sufficiently large to" to l p ~ k k for some fc, and thence glue 
up completely as above. This completes the proof of the theorem. □ 

It seems very plausible that some generalization of our methods should prove 
an analogous theorem for (p, q, oo) orbifolds with p,q > 2, or even arbitrary non- 
compact hyperbolic orbifolds with underlying topological space a disk, but the 
combinatorial endgame becomes progressively more complicated, and we have not 
pursued this. 

4.2. Combinatorics of 12 circles. Returning to the case of p = 3, we describe 
a slightly different method for producing immersed surfaces. Suppose we have 
a word X = R ai L bl R a2 ■ ■ ■ L bn for which the bi can be partitioned into subsets 
Bi = &j,2, • • • } (possibly empty) with the property that Oj » J2j In this 

case, we can choose lifts on and f3j such that for each i, all the j3ij are contained 
"under" the single arc on. 

In this case, we can still produce a surface S' for which d\yS' is a collection of 
circles with 1 and 2 vertices. We are thus naturally led to the question: which 12 
circles can be glued up completely? This seems like a hard combinatorial ques- 
tion; nevertheless, we describe some interesting necessary conditions and sufficient 
conditions (though we don't know a simple condition which is both necessary and 
sufficient). 

Example 4.3. Each 2 vertex must be glued to some unique 1 vertex, therefore the 
total number of 2's must be no more than the total number of l's. So, for example, 
the circles 2 and 221 can't be glued up. 

Example 4.4. The length of a consecutive sequence of l's can never be increased. 
So consecutive sequences of 2's must be associated to disjoint consecutive sequences 
of l's of at least the same length. So, for example, the circle 22211211211 can't be 
glued up, even though it has more l's than 2's. 

Example 4.5. A circle with alternating l's and 2's can be completely glued up. Any 
circle of the form 21 Cl 21 C2 • • • 21 Cm where each Cj > 7 can be completely glued up, 
since we can use the reductions 111- — > -2- or -11- — > -2- to replace -l c - by -121 • • • 21- 
whenever c > 7. Hence in particular, rot is extremal for any conjugacy class of 
the form R ai L bl R a2 L ■ ■ ■ R an L bn whenever the b, can be partitioned into subsets 
Bi = {&i,i, &i,2, • • • } (possibly empty) with the property that ai > 10 + ^ • + 10) 
for all i. 

Given two finite sets of positive numbers {a,} and {bi} (possibly with multiplic- 
ity), the problem of partitioning the bi into subsets Bi = {61,1,61,2, ■ ■■ } (possibly 
empty) with the property that a.j > bij is familiar in computer science, where 
the bi denote the lengths of a family of files, and the denote the lengths of a 
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family of empty consecutive blocks in memory. See e.g. [13], § 2.2 and § 2.5 for 
a discussion. The performance of dynamic memory allocation algorithms is very 
well studied, with respect to many different kinds of statistical distributions for 
the numbers {a^} and {bi}, and it would be intriguing to pursue this connection 
further. 

Example 4.6. In this paper we have discussed various sufficient conditions on the 
exponents a,i and bi for a word 7 to virtually bound an immersed surface. However, 
these conditions have only depended on the sets of values {a{\ and {bi}, and not 
their order. A complete understanding must necessarily take this order into account. 
For example, 

sc\(R 3 LRLR 2 LRL 2 ) = 1/6 = rot{R 3 LRLR 2 LRL 2 )/2 

whereas 

sc\(R 3 LRLRLR 2 L 2 ) = 4/15 > 1/6 = rot(i? 3 LRLRLR 2 L 2 )/2 

5. Experimental results 

In this section we describe the results of some computer experiments, comparing 
the functions scl(-) and rot(-)/2 in general. The function rot can be computed by 
an exponent sum for a conjugacy class expressed as a product of i?'s and L's, and 
scl can be computed using the algorithms described in [4] (implemented on the 
program scallop, available from [7]) and |6j, § 4.2.5. 

5.1. Distribution of n(X). For each word X in L and R, define n(X) to be the 
smallest negative number such that rot is extremal for L~ n X (if one exists), or the 
smallest non-negative number such that rot is extremal for R n X otherwise. 

Figure |8] shows a histogram of the frequency distribution of n(X), for all words 
X of length 10. 




-10 ' ' -5 ' ' ' ' ' ' ' 5 ' ' ' >'l0 
Figure 8. Histogram of freq(n(X)) for all 1024 words X of length 10 

It is a fact ([S]) that in an arbitrary word-hyperbolic group, most rationally 
null- homologous words of length n have scl ~ n/logn. On the other hand, rot is 
an example of a bicombable quasimorphism, and therefore by [8], the distribution 
of values on words of length n satisfies a central limit theorem; in particular, one 
has I rot I ~ y/n for most words of length n. It follows that n(X) is at least of size 
nj logn for most words X of length n, at least when n is large. 
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5.2. Stuttering. One might imagine from the discussion above that if rot is ex- 
tremal for R n X, then it must also be extremal for R m X for all m > n; however, 
this is not the case. We call this phenomenon stuttering. 

Example 5.1. The quasimorphism rot is extremal for R 3 LRL 2 but not for R 4 LRL 2 . 
It is extremal for R 2 LRL 2 RL but not for R 3 LRL 2 RL or R 4 LRL 2 RL. It is extremal 
for RLR 2 L 2 RLRL 2 R 2 L but not for R i LR 2 L 2 RLRL 2 R 2 L for 1< i < 5. We refer to 
these examples colloquially as stuttering sequences of length 1, 2 and 3 respectively. 
We do not know of any examples of stuttering sequences of length > 3, but do not 
know any reason why such examples should not exist. 
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